UMass at TREC 2002: Cross Language and Novelty Tracks
نویسندگان
چکیده
The University of Massachusetts participated in the cross-language and novelty tracks this year. The cross-language submission was characterized by combination of evidence to merge results from two different retrieval engines and a variety of different resources – stemmers, dictionaries, machine translation, and an acronym database. We found that proper names were extremely important in this year’s queries. For the novelty track, we applied variants of techniques that have been employed for other problems. In addition, we created additional training data by manually annotating 48 additional topics.
منابع مشابه
The University of Amsterdam at TREC 2002
We describe our participation in the TREC 2002 Novelty, Question answering, and Web tracks. We provide a detailed account of the ideas underlying our approaches to these tasks. All our runs used the FlexIR information retrieval system.
متن کاملSentence Level Information Patterns for Novelty Detection
SENTENCE LEVEL INFORMATION PATTERNS FOR NOVELTY DETECTION JULY 2006 XIAOYAN LI, B.E. TSINGHUA UNIVERSITY M.E., TSINGHUA UNIVERSITY Ph.D. UNIVERSITY OF MASSACHUSETTS AT AMHERST Directed by: Professor W. Bruce Croft The detection of new information in a document stream is an important component of many potential applications. In this thesis, a new novelty detection approach based on the identific...
متن کاملAn information-pattern-based approach to novelty detection
In this paper, a new novelty detection approach based on the identification of sentence level information patterns is proposed. First, ‘‘novelty’’ is redefined based on the proposed information patterns, and several different types of information patterns are given corresponding to different types of users’ information needs. Second, a thorough analysis of sentence level information patterns is...
متن کاملCross-Language Spoken Document Retrieval on the TREC SDR Collection
This paper presents preliminary experiments on crosslanguage spoken document retrieval (SDR) carried out on a benchmark assembled at ITC-irst. The benchmark is based on resources used in the last two spoken document retrieval tracks at the TREC conference, which are available on the Internet. They include automatic transcripts of American English broadcast news, short topics written in English,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002